00-255 
1496.00039 

pT^/^QTTn ARCHITE / " irpTTPTC "SING FPGA 
™ pg tm SYSTEM * rwTP DESIGN 



Field of Tnvention 

rpiat-pq to a method and/or 
The present invention relates no 

architecture for a Field Provable Gate Array (FPGA, core 

generally and, .ore particularly, to a diagnostic method and/or 

architecture using an FPGA core in a system on a chip (SOC) design. 
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A number of conventional approaches to chip design 
include (i) directly connecting internal signals to chip I/O pins 
using multiple layers of multiplexers,- (ii) read^n^chip 
readable registers through host^nterfaces ; (ill) running resistor- 
tr^o7To7c (RTL) simulations to create special case^o^ind 
bU gs and verify a fixed solution; and (iv, using (^ators^to 

verify the design. 

The first conventional approach is limited by the numher 
of the chip I/O Pins that can be used to access, the internal 
signals. For example, with 2 layers of 8 to 1 multiplexers, up to 
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8 signals are needed to be probed simultaneously, totaling 3 + 3 + 8=14 
I/O Pins that are needed for the design. With chip gate counts 
over 1 million gates, good coverage is difficult to achieve with 

such an approach. ^ ^~ ^- n 

5 The second^conventionai^pproach can only capture) aUapj 

ChoJof the /chip readable regies. While such an approach U 
Vhelpful in\d^-f-ying^h-e^Is^ce of a bug,, it does not offer 
enough information to precisely locate th/bugJ Additionally, 
1 special effort is needed in the design phase Wganize all of the 

m 

1<M registers. 

The third conventional approach can access all the chip 
internal signals, but has drawbacks. First, such an approach is a 
softwareAsimulation approach that is very time consuming. The 
Clatiolpeed is usually thousands of times slower than the real 
i;r ^^ econd , since such an approach is a software simulation 
method, it cannot reflect all the factors in the real system. 
Third, some bugs may not be uncovered with the RTL simulation 
method because of the running time limitation or the difficulty in 
creating the simulation case. 
20 The fourth conventional approach can run at a speed up to 

1 MHz. However, the emulator is very expensive (i.e., about $1 per 
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gate) . implementing such an emulator is not easily accomplished. 
Additionally, the verification cost can be very high. 

therefore, it would be desirable to provide a system for 
)that would minimize the drawbacks associated with 
systems . 
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summary »f the In vention 

The present invention concerns a system fot designing an 



'integrated circui 



it (IC)\The system generally comprising^ circjjir 

a programme portlc^l used for ^™ st ' icS \^ indin 9 b ^ 
The circul^ene-r-all-^mprises (i) a Wtional potion and (A) 
a logic portion that may be connected toWfunctional portion. 
The logic portion generally includes one or more interfaces. The 
pQgrammabirXtion may be configured to detect, correct and/or 
diag^se ^errors in the logic portion through the one or more 
interfaces. 

The objects, features and advantages of the present 
invention include providing a method and/or architecture for 
implementing a diagnostic architecture usi^n P^^cor^ a 
system on-chip design that may (i) easUringing upyverif ication 
^and debugging by providing interconnectioW_p*ogramming options; 
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(ii) observe important signals while the chip is running under a 
formal/mode; (iii) run at a (singlyltep mode while under the 
^ntrol of the FPGA core; (iv) display appropriate signals on a 
debugging workstation, allowing many debugging features to be 
supported such as: (a) triggering and tracing based on internal 
signals, (b) dynamically jhangixg h ost register j ralues and (c) 
providing complex monitoring functions, since the FPGA is 

p,^m^ timS and/ ° r (Vi) 

improve product time to market. 



Brief Description e >f t-he Drawings 

These and other objects, features and advantages of the 
present invention will be apparent from the following detailed 
description and the appended claims and drawings in which: 

FIG. 1 is a block diagram of a preferred embodiment of 

the present invention; 

FIG. 2 is a more detailed block diagram of the present 

invention; 

FIG. 3 is an even more detailed block diagram of the 
present invention; 



00-255 
1496 . 00039 

FIG. 4 is a timing diagram illustrating an exemplary 

operation of the present invention; 

FIG. 5 is an example of an implementation of the present 

invention on a chip; and 

FIG. 6 is another implementation of the present 

invention . 
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r-^i^ n^nr-int-ion t-h.* Preferred Embodiments 

With the exponential increase of the number of 
transistors integrated on a single chip, diagnostics of the chip 
becomes more and more challenging. Better accessibility to 
internal signals of the chip is highly demanded. 

The present invention implements an FPGA core as an 
I Qedded) Field Programmable Gate Array core that may be used to 
^abi-e^ogic to be programmed after the ^^^^ff^*^ 
The FPGA core can be used to implem^ \n-^J^™s^s to 
enlbie debugging-factions, such as bL monitoring, probing, single 
Lp running, triggering, capturing, ^cT^ example of such an 
F pV*a/be foundVco-pending application Serial No. 09/464,741, 
which is hereby incorporated by reference in its entirety. 
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Referring to FIG. 1, a diagram of a system 100 is shown 
in accordance with the preferred embodiment of the present 
invention. The system 100 generally comprises an FPGA core and 
logic block (or circuit) 102 and a debugging workstation block (or 
circuit) 104. The circuit 102 may be implemented as a single chip 
or integrated circuit (IC) . Additionally, the debugging 
workstation 104 generally works in combination with the FPGA core 

and logic block 102 . 
| Thesystern_100^ene ^lly allows for_ design of an 

integrate^Sr^u^^ * 
Ctional portion thit varies with design and a logic portion 
con^ed_tP^ftinctional portion (to be discussed in connection 
with FIGS . 2 and 3) . The logic jportion generally includes on ejr 
1 more interfaces that may be coupled to the debugging circuit 104. 
15 ° ^d^gZ^u Tt 104 may be conf igured^etect^errors in the 
logic portion through the one or more interfaces. 
-^-^^r7ing to FIG. 2, a more detailed block diagram of the 
FPGA core and logic block 102 is shown. The FPGA core an^logic^ ^ 
block^l-02 generally comprises a register portion 110 
20 ^Tion 1^> register portion 114, an FPGA core portion lTS"and 
"TSSnt^T^ortion 118. The FPGA core 116 may be implemented to 
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control the chip 102. The register portion 110 generally comprises 
a number of register blocks 120a-120n. The register portion 114 
generally comprises a number of register blocks 122a-122n. The 
logic block 112 generally comprises a combinational logic block (or 
circuit) 130, an I/O block (or circuit) 132 and a multiplexer 
circuit 134. A particular implementation of the multiplexer 134 
may be varied in order to meet the criteria of a particular 



implementation. 

The register block 110 may communicate with the FPGA core 
116 through a bus 140 and a bus 142. Similarly ^the register b lock 
llT^aT^lso commu^^Tth the FP^T^rTTie through a bus 144 
and a bus 146. The buses 140, 142, 144 and 146 may be implemented, 
in one example, as multi-bit buses. However, the buses 140, 142, 
144 and 146 may also be implemented as single bit buses, if 
appropriate. Additionally, the buses 140, 142, 144 and 146 may 
also be implemented as bidirectional buses. 

The FPGA core 116 may also communicate with the control 
block 118 through a bus 148. The FPGA core 116 may communicate 
through a number of I/O pins (e.g., I/0_PINS) over a bus 150. The 
FPGA core 116 may also-communicate with the debugging workstation 
104 through a (bus 152.) The buses 148, 150 and 152 may be 
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implemented as single bit buses or multi-bit buses. Additionally, 
the buses 148, 150 and 152 may be implemented as bidirectional 
buses . 

The system 10 0 may provide a chip diagnostic^ 
(Lhitecture^y ^ementing theF ^coreJ.16. By using-t-he FPGA 
Ve^i^T^Ie^nTs^ch chio'Sn^sIils , simultaneous probing 
of internal signals can be achieved while the system 100 is running 
under predetermined conditi ons (e.g., a normal mode co operation) . 
Additionally, a process may be implemented to allow the FPGA core 
116 to collect / ^ N from the registers 120a-120n and 122a-122n 
using a Can chain J while the system^OO is "running under 
predetermi^-cand^ions (e.g., a (step mode configuration) 




controlled by the FPGA core 116. 

With the FPGA core 116 running at a speed much faster 
than a clock of the system 100, data collecting from the internal 
signals can be done while the system 100 is running in the normal 
mode. For example, in consumer products, devices typically only 
run at 27 MHz, while the FPGA core 116 can operate at as fast as 
200 MHz to 400 MHz, or even higher. The FPGA core 116 can be 
programed to simultaneously probe multiple internal signals that 
are connected to the I/O block 132 while the system 100 is running 



00-255 
1496 . 00039 



10P 
ffl 



15 



20 



at a normal operational speed. After the data is collected and 
compressed, the data will be sent to the debugging workstation 104. 
The wav eforms of the^n ternal si gnals^under probing can be 
displayed as if they are directly connected to a logic analyzer by 
internal wires (via the debugging workstation 104) . 

The FPGA core 116 may be implemented to support up to IK 
of I/O data. Therefore, up to IK of internal signal probing can be 
supported simply by directly connecting the signals to the l/0_PINS 
of the FPGA core 116. Multiplexing (e.g., through the multiplexer 
134) may also be implemented to increase an amount of I/O data. 
For example, if 2 layers of 8 to 1 multiplexers are used, up to 64K 
of internal signal probing may be supported. Additionally, 
multiplexers may allow the circuit 100 to drastically reduce the 
number of internal connection wires. The particular number of l/Os 
supported by the FPGA core 116 as well as the particular number of 
multiplexers may be varied to meet the design criteria of a 
particular implementation. 

By controlling the system clock, t he^FPGA ^or e ,116 may^ 

also be programedto^n^ system^ B ^ 
CTiSTng^nTCan c^y^T^T^t^^^^ core 
116 may collecN^e/f rom the regist er blocks 110 an d_114 
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implemented on the system 100. Since the scan chair/is usually 
very long (e.g., several thousands of registers per scan chain), 
the scan chain may be separated into small segments to speed up 

data collection. 

Referring to FIG. 3, a diagram illustrating the breaking 
of a scan chain into segments is shown. The scan— e-hain 
implementation of the circuit 102 generally comprises a multiplexer^ 
logic block (or circuit) 160 and a scan chain segment bio^k (or 
circuit) 162. The multiplexer logic block 160 generally comprises 
a number of multiplexers 164a-164n. T^can^ain segment block 
1 162 generally comprises a number of /ultiplexeri 166a-166n and a 
^ number of scan chain segments (e . g . /^SC^S^feO , SCAN_SEG1 ... 

3_i, 

5 SCAN_SEGX) . The scan chain segments may allow the FPGA core 116 
select and collect register data more quickly. 

FIG. 4 shows the timing relationship between a system 
clock signal (e.g., SYS_CLK) and a clock signal^..-g^IAG_CbK) . 
The clock signal DIAG_CLK may be implemented as L diagnostic cloc/ 
The FPGA core 116 may control the diagnostic clock signal_DIAG_CLK 
The system clock signal SYS_CLK may be implemented during the 
20 normal mode. The diagnostic clock signal DIAG_CLK may be 
implemented during the single step mode. Both the clock signal 
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SYS_CLK and the clock signal DIAG_CLK are derived from the FPGA 
core 116. 



The following, stepiN^ut line collecting of register data 
using the segmented Can chain/under single step mode: 

(i) by prV-ammirfg the FPGA core 116, the debug 
workstation 104 may increase a particular number of registers on 
the chip 100 that need to be observed as well as the observation 
start and stop times of the particular registers; 

(ii) during the ob^v^ibn start and stop time, the chip 
100 may operate under the s^igle^tep mode; 

(iii) the FPGA co>e^l6 generates the clock signal 
SYS_CLK, the clock signal DIAG_CLK and select signals DIAG_SEL10 
and DIAG_SEL11 (which may be multi-bit or single bit signals) to 
control which scan segments need to be accessed; 

(iv) under the single step mode, after each effective 
edge of the system clock SYS_CLK, the data in the selected scan 
segments SCAN_SEG0-SCAN_SEGX may be shifted to the FPGA core 116, 
then shifted back to the selected scan segment SCAN_ S EG0-SCAN_SEGX, 
and then the clock signal SYS_CLK may resume; and 
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(v) only the data.f-r.om the registers that needs to- be 
monitor/d may be stored andUompressed/ The I/O interface may then 
the data to the debuggi-ng^workstation 104. 
Referring to FIG . 5, one or more chip I/O pins 200a-200n 
are shown connected to ttyFPGA core 116 . Such a connection can 
allow the FPGA core 116 1 monitor Le I/O of the chip 102. In 
addition, one of the I/O pins__2^0a-200n can be connected as an 
input pin to the FPGA core 116. An internal module from the FPGA 
core 116 may generate ,f slg^t-o the chip during the chip 

debugging. Wi th a (system ^n chip disign, the signals among 
different (fSsTcan al^o^^^^ugh^he FPGA core 116. 
AdditionallV^e FPGA core 116 can b4idge ) signals between 
different modules 202a-202n. ^Such^ridging can help in debugging 
of different modules. Such\bridgin^ can also help in isolating 




20 



problems . 

For those modules 202a-202n that are sensitive to^the 
clock frequency, such as the refresh control block in an ^SRAM 
In^Te (not show^ ' the FPGA core 116 can keep these modules 
^ ning at no -r^Tc^ock speed. However, special attention needs to 
be paid during the design phase, to make the whole system work 
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normally while the system clock is slowed down during the step 
running mode. 

Computer Aided Design ((CAD) software may also need to be 
developed for the debugging workstation 104 . The debug^orkstation 
104 ca n be de veloped^ to work ^t ogether with the ( on-chip 1 FP.GA— eore 
116. The functions of ^the_debug,wo.r.kstation inclu^To'ne or more of 
the following: 




(i) f programing the FPGA core tj> implement user required 



(A) collecting data from the FPGA core though the I/O 




to the debugging 



interface, and 

(iii) offering an end luser 



One function of such software may be to offer an easy to 




15 ■ use interface to the user. 

Referring to FIG. 6, a work flow (or system) 300 of an 
example software is shown. The software will read in the probe 
name file, in which all the siiHMs that can be probed are logged. 
The ^netlis tjile and theUm, codeJof the design are read. All the 
20 registers on different sckn-segments are mapped to the netlist and 
related to the\ RTL c^dfes . In this way, the user can easily decide 

13 




00-255 
1496.00039 

which signals need to be observed during thedebugging period. 
While the chip lO^rs^runningy^Wmal speed/the software can 
display any of the\internal sAlV^a^e connected to^ the I/O 
of the FPGA core lieNtoi^rtte chip 102 is running at( single step 



mode, the software can display^ll the signals on the^chip^ . 
This is because under ^Tstep Le, the FPGA core 116 can be 
programmed to access aVof^ on-chip registers. All the 
combinational signals can be derived from the related regis.t-e-r 
values and the netlist^information. By using such software, 
cap-also (program the FPGA)core 116 to implement different! 
unction^Tlie^debusg^g workstation 104 and the CAD soft^can 
S^secfjin different projects, resulting in reduction of cost. 

The function performed by the system 300 of FIG. 6 may be 
implemented using a conventional general purpose digital computer 
programmed according to the teachings of the present specification, 
as will be apparent to those skilled in the relevant art(s). 
Appropriate software coding can readily be prepared by skilled 
programmers based on the teachings of the present disclosure, as 
will also be apparent to those skilled in the relevant art(s). 

The present /fnvent ion may also be implemented by the 
preparation / ASICs/ FPGAsX or by interconnecting an appropriate 
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ne 




twork of conventional component circuits, as is described herein, 
modifications of which will be readily apparent to those skilled in 
the art (s) . 

\ The present invention thus may also include 
product wiich may be a storage^edium including instructions which 
Led to program a (computer J to perform a process in 
ac^rdance with the present Ynvent'ion. The storage medium can 
include, but is not limited to, any type of disk including floppy 
disk, optical disk, CD-ROM, and magneto-optical disks, ROMs, RAMs, 
EPROMs, EEPROMs , Flash memory, magnetic or optical cards, or any 
type of media suitable for storixg-e±ect»»ic^nstructionB. ^ 

With thejQ^hip progr^mingjapability^of the FPGA core' 
116 and the cWg^ng~~^ol^ features can be 

implemented withou^^r^ingAe cost in diagnostics. Such may 
include one or more of the following: 

/the host 



(i) programing the FPGA core 116 
J N c 



register every N jclock cycles, 




(ii) S?apturing signals^everyjCjclock cycles, 

(iii) .dynamically changing\the host reaister values, 

(iv) ( single step tracing), (counters^ and (pointers' 



monitoring] 



13 

m 



00-255 
1496.00039 



15 



(v) searching for a speci^rc-signal^pattern, 

(vi) tracing an internal/ state machine jtriggered on a 



programmed condition, 

(vii) monitoring the correcthess^bf 



and/ or 

(viii) implementing 
performance (e.g., the active 





ounting to measure the 
request, the execution 



coverage of the internal Jtatejr^chines , etc.). 



^^gG^ can be implemented either 



20 



alone or in combination. For example, only particularly important^ 
signals may need to be connected to the FPGA core 116 formal time 
monitorJg. The present invention may^pro^ a FPGA core-^n 
VsiX a ^hitecture that eases chip Wing upj verification and 
Zjging by interconnection and programming options. The present 
invention may allow important signa 
while the chip is running under a 
~ ±nt ^ TTKrr ^ FPGA ^ore-: 1/6 

a^TIirtn^T^^ be displayed while the chip is 

running under a single step mode by allowing a FPGA core to control 
the chip. 
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can 



By programming the FPGA core 116, many debugging features 
be supported, such as one/oTn^e of theHEol lowing : 



(i) programmed triggering and tracing based oyifiEternal 
signals (e.g., tracing theN.nt.er^rial stkt^-machines (pointers, 

counters ,/ etc . ) 

(ii) triggering on the specific values of address, data, 

or command bus ; 

(iii) dynamically changing the host register values; 

and/or 



(iv) complex j (monitoring) functions (e.g., protocol 

monitoring) . V^c^naU^"^ SyStem 10 ° ^ 
debugging/verification time and product time to market. 
□ The FPGA core 116 may simultaneously probe multiple 

internal signals. By utilizing the scan chain under the single 
step mode and with the on-chip FPGA core 116 acting as the data 
process center, all the signals on the chip 102 can be observed. 
The FPGA core 116 can be used to_br^dge_the_sig^als between 
different modules, and the under test mode, to isolate a specific 
module and drive signals to test the specific module. The FPGA 
core 116 can also be used to add or verify bug fixes. The process 
of the debugging workstation 104 working with the on-chip FPGA core 
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116 to generate many powerful debugging features may also be 
implemented. 

The system 100 may provide functionality of a CAD 
software that may be implemented to work with the diagnostics 
design. The system 100 may allow for enhanced-^debugging 
capabilities with the diagnostics design, such as^ 4earchin£ for a 
^ecific jignal pa^tg^n, facing the internal st< 
i^geri^^^^ogra^ed j^ondition . The system^ 
^t^ ^jonitoring of the correctness of the\bus protocol/ doing 
statist is^ounting to mea^T^hTl^rf^^ testing 




yl coverage . 

T While the invention has been particularly shown and 

described with reference to the preferred embodiments thereof, it 
will be understood by those skilled in the art that various changes 
in form and details may be made without departing from the spirit 
and scope of the invention. 
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